Implementing European Portuguese Verbal Idioms in a Natural Language Processing System
نویسنده
چکیده
This paper is based on an extant lexicon-grammar of European Portuguese verbal idioms (e.g., deitar mãos à obra, literally, ̳to throw hands to the work‘, ̳to start working‘.). This a database containing about 2,400 expressions, along with all relevant information on the sentence structure, distributional constraints and transformational properties of these frozen sentences. In this paper, we present a solution to the integration of verbal idioms in a fully-fledged natural language processing system, and a preliminary evaluation using a small, manually annotated corpus.
منابع مشابه
COST Action IC1207 PARSEME meeting
Dealing with idioms in Natural Language Processing systems is difficult, among other reasons, because their architecture must be conceived in such a way that it should not preclude the processing of both free word combinations and these, more constraint, expressions. On the other hand, many idioms do have syntactic structure, and can undergo several types of formal variation, thus making them h...
متن کاملIntegrating Verbal Idioms into an NLP System
This paper describes the integration of verbal idioms into an Natural Language Processing (NLP) system, adopting a construction approach, which is based on the prior parsing stage, so that these MultiWord Expressions (MWE) can be taken into account in subsequent tasks, such as semantic role labeling or whole-part relation extraction. The paper focuses on body-part nouns, which are often part of...
متن کاملVery Strict Selectional Restrictions: A Comparison Between Portuguese and French
We discuss the characteristics and behaviour of two parallel classes of verbs in two Romance languages, French and Portuguese. Examples of these verbs are Port. abater [gado] and Fr. abattre [bétail], both meaning ‘slaughter [cattle]’. Such collocations are intermediate cases between verbal idioms and largely free verb phrases. Precise knowledge of these verbs would aid recognition of verb sens...
متن کاملAuxiliary Verbs and Verbal Chains in European Portuguese
This paper describes auxiliary verb constructions in European Portuguese in view of their correct parsing in a fully integrated NLP chain. The paper provides data on these constructions over a large-sized corpus and evaluates the parsing system performance.
متن کاملVery strict selectional restrictions
We discuss the characteristics and behaviour of two parallel classes of verbs in two Romance languages, French and Portuguese. Examples of these verbs are Port. abater [gado] and Fr. abattre [bétail], both meaning ‘slaughter [cattle]’. Such collocations are intermediate cases between verbal idioms and largely free verb phrases. Precise knowledge of these verbs would aid recognition of verb sens...
متن کامل